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Spatial congregation of STAT binding directs selective 
nuclear architecture during T-cell functional 
differentiation 
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Higher-order genome organization shows tissue-specific patterns. However, functional relevance and the mechanisms 
shaping the genome architecture are poorly understood. Here we report a profound shift from promiscuous to highly 
selective genome organization that accompanies the effector lineage choice of differentiating T cells. As multipotent naive 
cells receive antigenic signals and commit to a T helper (Th) pathway, the genome-wide contacts of a lineage-specific 
cytokine locus are preferentially enriched for functionally relevant genes. Despite the establishment of divergent inter- 
actomes and global reprogramming of transcription in Thl versus Th2, the overall expression status of the contact genes is 
surprisingly similar between the two lineages. Importantly, during differentiation, the genomic contacts are retained and 
strengthened precisely at DNA binding sites of the specific lineage-determining STAT transcription factor. In cells from 
the specific STAT knock-out mouse, the signature cytokine locus is unable to shed the promiscuous contacts established in 
the naive T cells, indicating the importance of genomic STAT binding. Altogether, the global aggregation of STAT 
binding loci from genie and nongenic regions highlights a new role for differentiation-promoting transcription factors in 
direct specification of higher-order nuclear architecture through interacting with regulatory regions. Such subnuclear 
environments have significant implications for efficient functioning of the mature effector lymphocytes. 

[Supplemental material is available for this article.] 



Differentiation is a progressive process in which a progenitor cell 
is altered to become specialized for its physiological function. 
Contextual input can stimulate appropriate cell differentiation 
pathways that endow the cells with characteristic morphology and 
phenotype. This cellular specialization reflects reprogramming of 
gene expression orchestrated by several mechanisms, including 
actions of fate-determining transcription factors, modification of 
chromatin structure, and DNA methylation (Aune et al. 2009; 
Kanno et al. 201 1). Importantly, the dynamic gene reprogramming 
occurs in the context of a spatially organized nucleus (Joffe et al. 
2010). A potential role of nuclear higher-order organization in 
regulating cell-specific transcription is suggested by fluorescence 
in situ hybridization (FISH) observations showing that some 
functionally important genes reposition during lymphocyte dif- 
ferentiation in a manner correlated with expression (Brown et al. 
1999; Kosak et al. 2002; Hewitt et al. 2004; Kim et al. 2004; 
Spilianakis et al. 2005; Joffe et al. 2010). However the functional 
role of this positioning is not clear. For example, while silent 
genes tend to reside in inactive subnuclear environments such as 
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heterochromatic regions at nuclear periphery or pericentromeric 
loci, some activated genes remain close to the periphery, pre- 
cluding a simple model (Hewitt et al. 2004). Moreover, little is 
known about the underlying mechanisms that establish the 
nonrandom nuclear three-dimensional (3D) architecture during 
differentiation, and how the organization relates to the cell func- 
tion. Understanding the role of nuclear architecture in regulating 
the transcriptional program would require comprehensive explo- 
ration of the dynamics of chromatin interactions. 

Differentiation of multipotent naive CD4 + T cells to mature 
effector lymphocytes is critical for proper adaptive immune 
responses. Activation of naive CD4 + T cells from the initial antigen 
encounter induces specific differentiation into T helper (Th) cells 
that mount an appropriate phenotype of immune response to the 
particular pathogen, such as Thl, Th2, and Thl 7 (Murphy and 
Reiner 2002; Zhu et al. 2010; Nakayamada et al. 2012). A hallmark of 
Thl/Th2 distinction is the production and secretion of lineage- 
specific cytokines by differentiated T cells, interferon gamma (IFNG) 
for Thl and interleukin 4 (IL-4) for Th2 (Wilson et al. 2009; 
Balasubramani et al. 2010; Zhu et al. 2010; Kanno et al. 2011). The 
commitment of naive cells to either Thl or Th2 is modulated by the 
differentiation signals transmitted through members of the signal 
transducer and activator of the transcription (STAT) protein family 
(Adamson et al. 2009; Zhu et al. 2010). The early-acting STAT pro- 
teins are critical for Th lineage specification, whereas transcription 
factors such as NFKB, AP-1, and NFAT respond to T-cell receptor 
(TCR) signaling that occurs universally during differentiation for all 
Th lineages (Isakov and Altman 2002). These transcription factors 
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modify the gene expression program to direct the proper course of 
differentiation and enforce the lineage-specific function. Nota- 
bly, accumulating studies document abundance of nongenic bind- 
ing sites for most transcription factors, suggesting that the spatial 
encounters between genomic loci are important for their genome- 
wide effects (Biddie et al. 2010; Hakim et al. 2010; Wei et al. 2010; 
John et al. 2011). 

In this study we report a striking global reorganization of the 
nuclear architecture that occurs in naive T cells as they take on one 
of the two distinct fates, Thl or Th2. We provide genetic and 
molecular evidence that the lineage-inducing transcription fac- 
tors, STAT proteins, play a critical role in specifying such a func- 
tional nuclear architecture through their direct association with 
genie and nongenic loci. 

Results 

Lineage-specific interchromosomal association of key genes 
for Th cell function 

To determine the functional features of the Thl/Th2-specific 
nuclear architecture, we first examined whether genes critical for 
Th cell function associate preferentially in the relevant Th lineage, 
by 3D DNA FISH. Thl cells express high levels of Ifhg as well as 



Furin, encoding a Thl -specific proprotein convertase that en- 
hances IFNG secretion (Pesu et al. 2006). We found that Ifhg 
(chromosome 10) and Furin (chromosome 7) are more proximal in 
Thl cells in comparison to Th2 cells where the two genes are si- 
lenced (Fig. 1A). To investigate whether there is a similar proximal 
localization of functionally related genes in Th2 cells, we analyzed 
the spatial positioning of the cluster of Th2-specific interleukin 
genes (114, 115, and 1113) and Gata3, the gene encoding GATA 
binding protein 3, the master transcriptional regulator for the Th2 
lineage. Indeed, the Th2 cytokine locus (chromosome 11) is pref- 
erentially associated with Gata3 (chromosome 2) in Th2 cells (Fig. 
IB). These cell type-specific differences in interchromosomal as- 
sociation were modest but intriguing patterns to warrant a com- 
prehensive characterization of the full environment of these 
genes. We hypothesized that the T-cell functional branch-out 
differentiation would be a physiological system, allowing us to 
assess the potential role of nuclear architecture in cellular function. 

The interaction network of Ifng in Thl lymphocytes is distinct 
from that in Th2 and contains T-cell-specific genes 

For a comprehensive identification of the full spectrum of Ifhg 
contact loci, we performed the genome-wide chromosome con- 
formation capture on chip (4C) assay (Simonis et al. 2006) with 
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Figure 1. Cell type-specific interchromosomal association between functionally related genes in T lymphocytes. (A) 3D DNA FISH of Ifng and Furin in 
Thl and Th2 lymphocytes. Representative projections of images and cumulative frequency of 3D distance are shown. Green foci indicate Ifng; red foci, 
Furin; and blue, DAPI staining. Scale bar, 5 microns, (n) Number of cells analyzed for Thl (red curve) and Th2 (blue curve). (5) Similar analysis for the 114 
locus (green foci) and Gata3 (red foci). (C) Ifng 40. contact profiles at the Furin locus showing preferential contact in Thl cells. (D) 114 4C contact profiles at 
the Gata3 locus showing preferential contact in Th2 cells, (f) Thl -specific association of Ifng with Socs2, Il2rb, Traf3ip2, and ANKRD4 loci. Genomic 
coordinates in Mb (mm8). Thl in red and Th2 in blue throughout the figures. 
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Ifng as the bait. We initially focused on Ifhg because, as the only 
gene in the surrounding 300-kb region on the mouse chromo- 
some, it is expected to be a major determinant of the locus posi- 
tioning. Thus we reasoned that the contact landscape of Ifhg may 
permit a better delineation of the principles underlying Th cell- 
specific nuclear architecture (Misteli 2007). The Ifng 4C profile 
indeed reveals that Ifhg associates precisely with the Furin locus but 
not with a flanking region, which could not be ascertained with 
the resolution of DNA FISH (Fig. 1C). We further confirmed that 
contact profiles from our 4C analysis reflect quantitative associa- 
tion frequencies by 3D DNA FISH, observing high concordance 
between these assays (Fig. 1A-D; Supplemental Fig. SI). 

Inspection of the Ifhg interactome revealed a large number of 
Thl-specific genes and other immune pathway genes (e.g., Furin, 
Ifhgrl, Tnfaip3 [also known as A20], Socs2, Il2rb) that interact with 
Ifhg in Thl cells but not in Th2 cells (Figs. IE, 2; Table 1; Supple- 
mental Tables SI, S2). Indeed, a gene ontology analysis of all the 
genes on Ifng contacts from Thl showed an enrichment for genes 
involved in immune responses, signaling, and anti-microbial de- 
fense responses (Supplemental Fig. S2). For additional functional 
insight, Affymetrix microarray analysis was performed to monitor 
gene expression changes during Thl/Th2 differentiation and the 
response to TCR activation of the differentiated cells. Remarkably, 
many of these Ifhg contact genes were induced by TCR signaling in 
Thl (Furin, Dusp6, Fes [also known as Fps], Ppplrl6b [also known 
as Ankrd4], Arid3a [also known as BRIGHT]). Interestingly, some of 
the genes in Thl-specific contacts (Dusp6, Atp2bl) are TCR-responsive 
only in Thl, while not interacting with Ifhg and constitutively 
expressed in Th2. These Thl genes are silenced but poised to be 
activated specifically in Thl cells. These findings together indicate 
a functional aspect of the spatial association of these genes. 

The contact repertoires of Ifhg in the Thl and Th2 cell types 
are highly disjoint such that roughly half of the contacts are 
cell type-specific (Fig. 2B,C,F). The divergence is formed despite 
the fact that the majority of the contacts (90% and 93% in Thl 
and Th2, respectively) are on the cis chromosome. The inter- 
chromosomal interactions are also highly lineage-specific, with 
only 1% of trans contacts in Thl shared by Th2. The observed cell 
type specificity is robust despite the fact that the two Th cell types 
emerge from the naive T lymphocytes after differentiation for only 
1 wk. To understand the mechanism behind the specification of 
nuclear architecture during lineage commitment, we decided to 
look into the dynamics of the process and examined the original 
spatial environment of Ifhg in naive T cells. 

In naive T cells, transcriptionally inactive Ifng has promiscuous 
interactions with gene-rich regions 

Analysis of Ifng interactome in naive T cells revealed an excep- 
tionally large number of genomic contacts (12,201 positive probes 
[3.2% of all probes, comprising 419 genomic regions] from naive 
vs. 5291 [1.4%, 222 regions] and 6254 [1.6%, 256 regions] positive 
probes from Thl and Th2, respectively) (Fig. 2). In terms of genes 
present in the contact loci, the number of Ifhg contact genes de- 
creases from 1117 in the naive cells to 638 in Thl and 506 in Th2 
cells. Importantly, the vast majority of the Ifng contacts in the 
committed cells (88% and 65% of positive probes from Thl and 
Th2, respectively) are pre-established in the naive cells (Fig. 2F). A 
closer examination shows that the contact regions are often 
broad in naive cells, but are more localized in the differentiated 
cells (Fig. 2D). These data indicate that the interactome of Ifng 
undergoes a massive divergent selection during Thl/Th2 cell 



differentiation, primarily by elimination of promiscuous genomic 
associations in naive cells. 

Despite the predominant trend of reduced interactions, there 
were genomic loci that acquire Ifng contact specifically in Thl. 
Interestingly, these contacts harbor many Thl -relevant genes 
(Fig. 2B,D,E; Supplemental Tables S1-S3). Moreover, interactions 
with a subset of the genes that come into the subnuclear envi- 
ronment of Ifhg during Thl differentiation are lost instead in Th2- 
differentiated cells (Fig. 2B,C). In addition, some of the genes that 
lose contact with Ifhg during Thl differentiation increase interaction 
with Ifhg in Th2. These overlaps between reciprocally gained/lost 
gene contacts are far greater than expected by chance (P < 10~ 4 , 
Fisher exact test). These results suggest an exquisitely coordinated 
lineage specification of the nuclear architecture around the Ifhg locus. 

A stark contrast was observed for the Ifhg interactome when 
we performed the 4C analysis on 3T3-L1 mouse embryonic fibro- 
blasts, an unrelated cell type (Supplemental Fig. S3). In the fibro- 
blasts, the contact landscape of Ifhg is limited to gene-poor regions, 
despite a comparable number of contact loci (data not shown). 
Notably, only a negligible fraction of the abundant Ifng contacts in 
the naive T lymphocytes (5% of positive probes) is shared with 
3T3-L1. Comparison of fibroblasts and naive lymphocytes high- 
lights the observation that, even though the Ifhg gene is inactive in 
both cell types, an enormous contact repertoire exists only in the 
naive T cells and is poised to diverge into lineage-selective profiles. 

The Thl-selective nuclear architecture at Ifng requires STAT4 

The fate of naive CD4 + T lymphocytes is largely determined by the 
transcription factors acting downstream from the cytokines 
that promote differentiation into a specific lineage. STAT4 plays 
a crucial early role by mediating the signal from the Thl -inducing 
cytokine IL-12 and activating transcription of T-box expressed in 
T cells (T-bet). This constitutes a feed-forward regulatory circuit 
that promotes a fully mature Thl phenotype and IFNG secretion 
(Lighvani et al. 2001). Another member of the STAT family, STAT6, 
responds to IL-4 signaling instead and directs the differentiation 
toward the Th2 lineage. Given the importance and specificity of 
STAT4 for Thl differentiation, we examined its effects on nuclear 
architecture around Ifng. 

To determine whether the Thl interactome can develop 
without STAT4, we isolated naive CD4 + T cells from Stat4 knock- 
out (KO) mice, subjected them to the Thl differentiation protocol, 
and analyzed the Ifng contacts by 4C. The global reduction in Ifng 
interactions was severely hampered in the KO Thl cells, suggesting 
incomplete specification of Thl-selective organization (Fig. 3A-C). 
The absence of STAT4 affected both the gain (downward arrows) 
and the loss (upward arrows) of Ifhg interactions seen in wild- type 
(WT) cells (Fig. 3; Supplemental Fig. S4). Given that the TCR-me- 
diated molecular pathways are intact in the Stat4 KO T cells, it is 
noteworthy that the TCR signaling, common to all Th lineages, is 
not sufficient to generate the Ifng interactome in Thl cells. We 
conclude that the nuclear architecture in the Stat4 KO cells is 
promiscuous and less specialized, resembling the state of 
undifferentiated cells. 

The overall transcriptional activity of Ifng interactome is similar 
in Thl and Th2 despite the lineage-specific expression of Ifng 
and divergent contact selection 

The failure of the Sto£4-deficient cells to generate Thl-selective Ifng 
contact repertoire may result from the altered reprogramming of 
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Figure 2. Lineage-specific reorganization of the Ifng interactome. The genomic map of Ifng contacts in the naive (A), Thl (B), and Th2 (C) lymphocytes 
shows the locations of positive 4C probes (P-score > 4; see Methods) marked in a color scale with red corresponding to high contact frequency (log 2 ratio of 
4C/input DNA). (Gray) Locations of all other probes. (Arrowhead) Location of Ifng. The genes that reside in dramatically gained contact loci in Thl (B) or 
lost contact loci in Th2 (C) are listed. (D,£ ) Examples show selection from pre-existing contacts in naive cells as well as emergence of de novo contacts in 
the differentiated cells. (*) Retained contacts; (+) gained contacts. (F) The Ifng contact regions that are detected from each cell type. The Venn diagram 
areas are drawn in proportion to the numbers of probes in each category of overlaps. 



gene expression in the absence of STAT4. To understand the con- 
tact reorganization in terms of transcription, we first focused on 
the dynamic component of the Ifng interactome and looked for 
any correlation with changes in transcription. We identified the 
genomic loci that dramatically changed their interactions with Ifng 



during Thl differentiation, and analyzed the expression profiles of 
genes within these loci in naive, Thl, and Stat4 KO cells. The genes 
that come into contact with the activated Ifng in Thl were found to 
be more highly expressed than those that lose interaction (Fig. 4A, 
box plot). This result is mirrored by a reciprocal pattern in the Th2 
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Table 1. Genes within Thl -specific Ifng contact loci and their functional features 
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The expression information is from the Affymetrix microarray analysis (see Methods). Changes in expression by more than twofold are noted. The Ifng 
interaction information is based on the comparison of 4C profiles from naive, Thl, and Th2 cells. The complete lists of genes near Ifng contacts in naive, 
Thl, and Th2 cells are provided in Supplemental Tables SI, S2, S3. 



lineage, where Ifng is silenced (Fig. 4B, box plot). While the di- 
chotomy is striking, an absolute majority of lost or gained contact 
genes already have similar levels of expression in the naive cells 
(Fig. 4, heat maps). For example, the expression of genes that 
gained Ifng contact did not increase appreciably from naive to Thl. 
Likewise, the gain of Ifng contact in the Th2 lineage did not confer 
the genes de novo repression from naive to Th2. Importantly, the 
transcriptional regulation of these dynamic contact genes is largely 
intact in the absence of STAT4 (Fig. 4, heat maps, rightmost 



column). This rules out the possibility that the disrupted Ifng 
interactome from Stat4 KO is purely due to lack of gene regulation 
by STAT4. 

Unexpectedly, despite the lineage-specific repositioning of 
genes, the transcriptional activity of the resulting interactome of 
Ifng is not significantly different between Thl and Th2 (Fig. 4C). 
This is because the gene repositioning involved a relatively small 
number of genes and has a negligible contribution to the overall 
expression status of Ifng contacts. Thus, the simple concept of Ifng 





Figure 3. STAT4 knock-out T cells fail to develop Thl -specific Ifng interactome. (A,B) Examples of Ifng contacts in naive (green), Stat4 KO (pink), and 
Thl (red) cells in trans (A) and c/'s (6). Stat4 KO cells fail to induce (A) or eliminate (B) specific contacts. (Arrows) Contact loci that are disrupted in KO versus 
WT Thl . (C) The overlaps between contacts from naive, Thl , and Thl -induced Stat4 KO. The areas are drawn in proportion to the numbers of probes in 
each category of overlaps. 



466 Genome Research 



www.genome.org 



STAT binding shapes 3D genome architecture 



B 



Th1 



O 



Th2 



Gained Ifng 
contacts in 
Th1 (26 genes) 



Lost Ifng 
contacts in 
Th1 (54 genes) 




Gained Ifng f 
contacts in < 
Th2 (25 genes) [ 



Gained Lost 



Gained Lost 



All genes in contact 



w 12 



-9- 



Lost Ifng 
contacts in I 
Th2 (200 genes) 





Th1 Th2 

(638) (506) 
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genes on regions that specifically change Ifng interactions was examined in all cell types (see Methods). During Thl differentiation, Ifng is transcriptionally 
activated and increases interactions with a set of genes that are robustly expressed in all CD4 + T cells. On the other hand, the loci that lose contact harbor 
genes whose transcript levels are low throughout. (B) In Th2 cells, the transcriptionally silent Ifng gains numerous interactions with silenced or repressed 
genes, while losing contact with some highly expressed genes. Box plots show the expression values from the corresponding lineage, Thl (A) and Th2 (B). 
The heatmaps include expression data from the other cell types, and are sorted by the expression values in Thl (A) and Th2 (B) within each category of 
genes. (C)The expression of all the genes in contact with /fnginThl andTh2 are nonetheless similar. The number in parenthesis indicates the total number 
of genes in each set. 



residing in the transcriptionally active compartment in Thl and in 
the inactive compartment in Th2 is not valid, even though Ifng is 
active in Thl cells but inactive in Th2 cells. These observations 
indicate that expression alone cannot explain the lineage-specific 
reorganization of nuclear architecture and the phenotype defect in 
Stat4 KO T cells, raising the possibility that mechanisms other than 
expression may play a role. 

Changes in Ifng interactions occur at STAT4 binding sites 

Given the poor correlation between Ifng interaction dynamics and 
transcription, we then investigated whether STAT4 may affect 
nuclear architecture directly. We compared the Ifng 4C data with 
the STAT4 binding profile in Thl cells reported recently (Wei et al. 
2010). Interestingly, Thl contact regions are predominantly 
located at or near STAT4 binding sites (Fig. 5A,B; Supplemental Fig. 
S4). Moreover, loss of Ifng contact in Thl tends to coincide with 
sites that lack STAT4 binding (Fig. 5B, arrow). Vice versa, STAT4 
sites are concentrated preferentially on Ifng contacts in Thl, de- 
spite the overall reduction of the Ifng interactome from naive to 
Thl cells (Fig. 5C). Thl -specific spatial clustering of STAT4 sites is 
especially dramatic on the cis chromosome: 80% of all STAT4 
binding sites on chromosome 10 are located at contact loci in Thl 



cells, whereas only 35% of these sites are in contact with Ifng in 
Th2 cells. Taken together, these results demonstrate that the Ifng 
interactome developing during Thl differentiation is selectively 
enriched for STAT4 binding sites. 

For more direct and quantitative appraisal of the role of STAT4 
in the nuclear architecture dynamics, we examined the 4C contact 
profiles at sites bound by STAT4 in Thl cells. In addition to the 
enrichment of STAT4 binding sites at Ifng contact loci observed 
above, Ifng contact probability increases at 61% of all genomic 
STAT4 sites (4203/6894) (Fig. 5D, STAT4 binding sites plot). Re- 
markably, the increase is local and precisely at STAT4 binding sites, 
while the probability of contact with the flanking regions is lower 
in Thl (Fig. 5D, black arrows). The larger-scale profiles reveal that 
the global reduction in Ifng contact probability predominantly 
occurs in regions flanking these STAT4 sites in Thl, which suggests 
sharpened repositioning of interaction regions favoring STAT 
bound sites against the nonspecific backgrounds. Furthermore, the 
finer repositioning of interaction at STAT4 binding sites was no 
longer detectable in the Stat4 KO cells (Fig. 5D, dashed curves). The 
intriguing preference of I/hg interactions for binding sites is not found 
when we performed the same analysis on a control set of sites from 
a closely related transcription factor STAT6 that promotes Th2 
differentiation instead (Fig. 5D, STAT6 binding sites plot). 
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Figure 5. Thi -specific contacts are selected and strengthened at STAT4 binding sites. (A,B) Examples of STAT4 binding (black track) at Ifng contacts in 
naive (green), Stat4 KO (pink), and Thi (red) cells in trans (A) and c/'s (B). (C) The Ifng contact loci have the highest enrichment of STAT4 binding sites in 
Thi cells. The plot shows the numbers of Thi -derived STAT4 ChlP-seq sites per positive 4C probe for each 4C data. (**) P-value < 0.01 (permutation test, 
see Methods). (D) The average profile of Ifng 4C for each cell type centered at STAT4 binding sites. (***) The arrow represents a significant change (t-test 
P-value < 0.005). The average profile of Ifng 4C for each cell type centered at STAT6 binding sites (in Th2). 



On the other hand, the contact profiles of the remaining 39% 
of STAT4 sites deviate from the pattern described above. These sites 
are in contact with Ifng in the naive T cells but lose the interaction 
in Thi cells, consistent with the global trend from the naive to Thi 
(data not shown). It may be relevant to note the observations that 
lineage-promoting factors not only induce genes that enforce the 
cell identity but also actively repress genes specific for the other 
"forbidden" lineages (Usui et al. 2006; Djuretic et al. 2007; Zhuang 
et al. 2009; Yagi et al. 2010). An alternate plausible possibility is 
that these STAT4-bound genomic loci that dissociate from Ifng may 
gain new interactions with other Thi -specific loci. We note that 
the Thi -specific Ifng interactome contains many but not all Thi 
genes. For example, there is no detectable 4C signal at Hlx, Runx3, 
Twist 1, or Zbtb32 (data not shown). Thus it is conceivable that 
there may be more than one Thi -specific subnuclear hub that 
congregates STAT4 binding sites in Thi cells. Together these results 
strongly suggest that STAT4 binding to the genome contributes to 
the specification of the nuclear architecture around Ifng during 
Thi differentiation. 

Reciprocal specification of nuclear architecture around the 114 
locus takes place with Th2 differentiation 

To determine if a similar extent of reorganization happens for the 
Th2 lineage, we analyzed the spatial environment of 114, a sig- 
nature cytokine gene highly expressed by Th2 lymphocytes. 114 
manifested a distinct subnuclear environment, with little overlap 
with Ifng contacts as seen by the disjoint 4C profiles in all cell types 
(Supplemental Fig. S6B). Nevertheless, many properties found for 



Ifng were recapitulated in the 114 4C data (Supplemental Figs. S5, 
S6). The 114 interactome also shifts from promiscuous contacts in 
the naive cells to highly selective partners in Th2 cells (Supple- 
mental Fig. S5). Interestingly, despite the global trend, 114 gains 
physical interactions with some Th2-specific genes, including 
Gata3, Ikzfl, Gatad2a, and Nucbl, in Th2 cells but not in Thi cells. 
Ikzfl encodes Ikaros, a factor that silences T-bet during Th2 dif- 
ferentiation (Thomas et al. 2010). As observed from the Ifng 
interactome, the overall transcriptional activity of the 114 contact 
loci is similar in both cell types, despite their lineage-specific 
divergence (Supplemental Fig. S6C). Again, this finding is incon- 
sistent with the simple characterization of 114 interactome based 
on transcriptional activity. 

Mirroring the role of STAT4 in Thi differentiation, STAT6 
responds to the Th2-promoting cytokine IL-4 and trans-activates 
the signature transcription factor GATA3, enforcing the Th2 
lineage (Zhu et al. 2010; Kanno et al. 2011). When we performed 
the analogous analyses to assess the spatial distribution of STAT6 
binding sites in Th2 cells, 114 contact loci were only slightly 
enriched for STAT6 sites in Th2, which is not as pronounced as for 
the Ifng in Thi (Supplemental Fig. S6D). The more heterogeneous 
picture may emerge from the immune-irrelevant genes and their 
ubiquitous activity in the genomic context of 114 that also 
contribute to the genome-wide interactions of this locus (Morey 
et al. 2009). Nonetheless, naive T cells from Stat6 KO mice failed to 
develop a specialized set of 114 contact loci under the identical Th2 
differentiation condition (Supplemental Fig. S6E,F). We conclude 
that the absence of STAT6 disrupts the Th2-selective nuclear 
organization at 114 in response to Th2-inducing signals. 
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Discussion 

In this study, we show that the T-cell nuclear architecture un- 
dergoes a dramatic transition to support the functional speciali- 
zation of Th effector cells after pathogen encounter. Differentia- 
tion of primary CD4 + Th lymphocytes, where multiple lineages 
branch from the same type of cells, provided a superb model to 
dissect the functionally relevant changes in nuclear architecture. 
By comparison with observations from the Th2 lineage, we were 
able to delineate Thl -specific features of the contact landscape for 
Ifng. In the naive progenitor cells, genomic interactions of the in- 
active Ifhg locus are surprisingly promiscuous, reflecting a differ- 
entiation potential for several permissible lineages. Most of these 
abundant contacts are lost as lymphocytes commit to the Thl 
lineage, giving rise to a highly selective subnuclear environment 
for Ifhg. This is reminiscent of a similar pattern observed for ex- 
pression: Pluripotent stem cells switch from a broad spectrum of 
expressed genes to a more restricted program during differentia- 
tion, shutting down unnecessary genes while maintaining those 
relevant for cell function (Terskikh et al. 2003; Efroni et al. 2008). 
Some genes gain contact specifically in Thl cells and, interestingly, 
are important for Thl function. Many of these strengthened Thl 
contacts are eliminated from the neighborhood of Ifhg in the Th2 
lineage, further highlighting the exquisite cell type specificity of 
their interactions. 

We observe a lack of correlation between nuclear reorga- 
nization and transcriptional activity during Th differentiation. 
Monitoring the dynamics of nuclear organization and the tran- 
scriptional program in parallel allowed us to assess the potential 
role of transcription in driving lineage-specific contact changes. 
However, as naive cells differentiate to Thl cells where Ifhg be- 
comes active, the locus does not transition into a highly active 
compartment. Similarly, the interactome of Th2-activated 114 gene 
has minimal changes in global expression status before and after 
differentiation. Moreover, the dynamic contact genes themselves 
do not exhibit a coordinately regulated expression pattern that 
correlates with gain or loss of interaction with the cytokine locus. 
Perhaps the most striking finding that violates the transcription- 
based model of nuclear organization is that the massive gene-rich 
repertoire of interactions with Ifng exists in naive cells but not 
in fibroblasts, even though Ifng is silenced in both cell types. 
Furthermore, Ifng 4C was not correlated with H3K4me3 and 
H3K27me3 marks published previously (Wei et al. 2009; data 
not shown). Consistent with this finding, perturbation of post- 
translational histone modifications was reported to have no im- 
pact in shaping higher-order structures (Splinter et al. 2011; Nora 
et al. 2012), despite the apparent correlation of their profiles with 
genome conformation (Lieberman-Aiden et al. 2009; Dixon et al. 
2012). These results support that a simple dichotomy based on 
active and inactive nuclear spatial compartments does not explain 
the functional reorganization during Th differentiation, pointing 
to additional mechanisms that underlie the cell type-specific 
transition of nuclear architecture. 

We present genetic and molecular evidence that STAT pro- 
teins play a critical role in establishing the functional nuclear ar- 
chitecture of effector Th cells. In the absence of STATs, the highly 
selective interactome fails to develop. This may be partly due to 
the abolished transcription of lineage-enforcing genes that are 
normally activated by STATs. Along this line, a tissue-specific 
transcription factor was shown to be important for the nuclear 
organization in erythrocytes (Schoenf elder et al. 2009). But ques- 
tions still remain whether the factor acts directly or via regulation 



of gene expression. Different from studies that focused on post- 
differentiation state, our dynamics-conscious analysis has allowed 
us to identify, for the first time, contact partner selection as the 
predominant mechanism behind the formation of a mature 
architecture during terminal differentiation. An important novel 
insight is that transcription factor binding (STAT binding to 
chromatin), rather than transcriptional activity, is a critical factor 
in shaping the spatial configuration of the genome. STAT4 seems 
to have a direct role in functional organization of the nuclear 
architecture, as indicated by the strengthened interactions that 
occur precisely at its binding sites in Thl. We show that gene 
expression status is not likely to drive the establishment of nuclear 
architecture, despite the confounding correlation with the inter- 
actome. Our study raises a new paradigm with a potential role for 
transcription factors as nuclear organizers. Undoubtedly, future 
studies will need to address the relative contributions of numerous 
chromatin binding factors to the various hierarchical levels of 
genome organization (Dixon et al. 2012; Lin et al. 2012; Sexton 
et al. 2012). 

What could be functional implications from a nuclear orga- 
nization driven by binding of chromatin regulators? One possi- 
bility is that such a nuclear architecture may bring together a set of 
genes that may not necessarily be transcribed immediately but 
possess the potential to be regulated in response to relevant stim- 
uli. Consistent with this model, many TCR-responsive genes 
associate with Ifng in the mature cells before transcriptional 
activation by TCR stimulation. Similar association has recently 
been observed for hormone-responsive genes before transcrip- 
tional activation (Hakim et al. 2011). STAT proteins, like many 
transcription factors, bind to numerous intergenic regions at 
considerable distances from any known gene (Wei et al. 2010). The 
marked spatial clustering of STAT4 binding sites in Thl cells 
supports a high local concentration of STAT4 proteins within 
a close 3D neighborhood of Ifng in the nucleus, potentially facili- 
tating transcriptional regulation (Hakim et al. 2010, 2011; Wang 
et al. 2011). Recent reports have implicated the presence of 
regulatory elements, observed as DNase I hypersensitive sites 
(DHSs), as the most prominent genomic feature within physical 
DNA interaction regions (Lieberman-Aiden et al. 2009; Hakim 
et al. 2011; Yaffe and Tanay 2011). In the context of Th cell dif- 
ferentiation, we identify STAT4 as a chromatin factor that binds to 
such regulatory elements that affect nuclear architecture. STAT4 
may be one of several factors that act on the regulatory elements, 
because cell type-specific regulatory sites can be visited by multiple 
transcription factors in a combinatorial and cooperative fashion 
(e.g. Ram et al. 2011; Rao et al. 2011; Siersbaek et al. 2011; 
Voss et al. 2011). Indeed, some STAT4 binding sites are also bound 
by T-bet in Thl (Sekimata et al. 2009; Nakayamada et al. 2011). 
Structural proteins like CTCF and cohesin are also involved in the 
long-range association of Ifng with nearby regulatory loci (Hadjur 
et al. 2009; Sekimata et al. 2009). The contribution of these factors 
in the establishment of large-scale genome folding remains to be 
explored, but the fact that many STAT4 binding sites are 
inaccessible in naive cells (Nakayamada et al. 2011) suggests 
that STAT4 activation may be an important pioneering event in 
Thl-specific genome organization. Taken together, these results 
support that STAT proteins are a major determinant of lineage- 
specific higher-order structure of the genome. 

Our findings elucidate a STAT4-mediated mechanism critical 
in the formation of a functional nuclear architecture at the Ifng 
locus. The reorganized interactome represents an important 
physical mark of committed cells that is hard to reverse, which in 
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turn contributes to the maintenance of the distinct cellular iden- 
tity. These data provide a basis for understanding how lineage- 
specific transcription factors not only regulate gene expression 
but also play a prominent role in shaping a nuclear architecture 
that supports specialized effector function in cellular differentia- 
tion (Fig. 6). 

Methods 

Mice, cell isolation, culture, differentiation 

C57BL/6J mice and Stat6 KO mice were purchased from Jackson 
Laboratory. Stat4 KO mice were provided by Dr. M. Kaplan. All 
mice were handled and housed in accordance with the guidelines 
of the National Institutes of Health Animal Care and Use Com- 
mittee. Splenic and lymph node T cells were obtained by disrupt- 
ing organs of 8- to 12-wk-old mice. All cell cultures were performed 
in RPMI supplemented with 10% fetal calf serum, 2 mM gluta- 
mine, 100 IU/mL penicillin, 0.1 mg/mL streptomycin, 10 mM 
Hepes, 1 mM sodium pyruvate, and 2.5 |xM (3-mercaptoethanol. 
T cells were enriched with a CD4 + T-cell kit and AutoMacs 
isolator (Miltenyi Biotec). Naive CD4 + T cells were isolated by 
flow cytometry, staining with CD4, CD62L, CD44, and CD25 
antibodies. Naive T cells were first cultured in the presence of plate- 
bound anti-CD3 and anti-CD28 (10 |xg/mL each) under Thl (IL-12 
[10 ng/mL] and anti-IL-4 [10 |xg/mL]) or Th2 (IL-4 [10 ng/mL], anti- 
IFNG [10 fx?g/mL]) conditions for 3 d and then cultured further 
in IL-2 (50 U/mL) in combination with IL-12 (Thl) or IL-4 (Th2) 
and grown an additional 4 d. Cytokines were purchased from 
R&D Systems. Anti-CD3 and anti-CD28 were from eBioscience 
and other antibodies were from BD Biosciences. 
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Figure 6. Selection of contact loci during Thl differentiation favoring 
spatial clustering of STAT4 binding sites. A model depicts the transition of 
the subnuclear interactome of Ifng during Thl differentiation. In the naive 
progenitor cells, Ifng is silenced and has promiscuous contacts. Thl- 
inducing cytokines activate STAT4, and the transcription factor binds to 
regulatory elements in the genome. Ifng interactome is reorganized pre- 
dominantly by retaining and strengthening contacts that harbor STAT4 
binding sites. The resulting subnuclear environment of Ifng in Thl supports 
a high local concentration of STAT4 and other cooperating factors. 



3D DNA FISH analysis 

Cultured T lymphocytes were set on a poly-L-lysine-coated cover 
slip (BD Biosciences) by centrifugation at 1000 RPM for 5 min. We 
followed our previously described 3D FISH protocol after fixation 
in 4% PFA for 10 min (Hakim et al. 2011). Probes were prepared 
by labeling bacterial artificial chromosomes (BACs) with biotin 
(Roche; Biotin-Nick translation mix) or digoxigenin (Roche; 
DIG-Nick translation mix). For each BAC, single colonies were 
grown and verified by PCR. Biotin-labeled probes were detected 
by Fluorescein Avidin (Vector), and digoxigenin-labeled probes 
were detected with anti-Dig- Rhodamin (Roche) diluted 1:200. 

BAC probes were used as follows: Ifng, RP24-368M14; Furin, 
RP23-377F13; 114, RP23-449L19; and Gata3, RP24-402N11. BACs 
used for 4C validation (Supplemental Figs. SI and S6) are as fol- 
lows: RP23-428D6, RP24-305D11, RP23-13J24, RP24-239H10, 
RP23-223C18, and RP24-319B9. Image acquisition and analysis 
was performed according to the method previously described 
(Hakim et al. 201 1). Briefly, to determine 3D distances, images were 
analyzed by a custom automated computer algorithm. This algo- 
rithm assigns the 3D spatial position of each locus at the center 
(brightest pixel) of the FISH focus and then calculates the distances 
between all the possible pairs from the different color channels. 
The algorithm generates final outputs from the closest pair in each 
nucleus. Cumulative distributions of FISH distances up to 1 micron 
are presented in Figure 1 and Supplemental Figures SI and S6, since 
4C contact frequency is highly correlated with FISH in this 
distance range (Hakim et al. 2011). 

Chromosome conformation capture on chip 

The 4C assay was performed according to the method previously 
described (Simonis et al. 2006; Hakim et al. 2011). 3C ligation 
junctions DNA library, reflecting in vivo spatial proximity, was 
generated with Hindlll (New England Biolabs). For circularization, 
the ligation junctions were digested with Csp6I (Fermentas). Baits 
were amplified with inverse PCR primers (shown from 5' to 3') 
as follows: Ifng, Ifng_4C2F aagccagtttgtcatcatgcand Ifng_4C2R 
tggcacgcttaatgatcaaa; 114, I14_4C_C ggtctcaacccccagctagt and 
I14_4C_H ccttgaactcagactcacaagc. Each experiment was indepen- 
dently performed twice. 



Statistical analyses of 4C 

The primary analysis of 4C microarray data was performed 
according to the method previously described (Hakim et al. 2011). 
Briefly, 4C signal for each condition was taken to be the average of 
the quantile-normalized 4C/input log ratios from biological du- 
plicates. Statistically significant enrichment of high signal probes 
was assessed by a x test on the 100-kb running window centered at 
the probe. 4C probes were considered "positive" if the x 2 P-score 
(-log 10 P) is greater than 4. In the Venn diagrams, positive 4C 
probes were used for counting. A genomic map of contacts presents 
the positive 4C probes, marked in a color scale based on log 2 4C/ 
input DNA. Contiguous probe-clusters in Supplemental Tables SI 
through S3 were obtained by merging positive 4C probes within 
100 kb on the chromosome. In all the genome browser shots, en- 
richment of 4C signal is presented by P-score profiles. The genes on 
contact regions were obtained by retrieving those whose tran- 
scription start site (from UC Santa Cruz "RefSeq Genes" annota- 
tion table at http://genome.ucsc.edu/cgi-bin/hgTables) is within 
10 kb of positive probes. 

The probes for the genomic loci that dramatically change 
interactions with Ifng during Th differentiation in a lineage- 
specific manner were identified using the following criteria: 
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• Gained contacts in Thl: Thl P-score - naive P-score > 6, Th2 
P-score < 2, and 3T3 L-l P-score < 2; 89 probes total. 

• Lost contacts in Thl : naive P-score - Thl P-score > 6, | Th2 P-score - 
naive P-score | < 3; 578 probes total. 

• Gained contacts in Th2: Th2 P-score - naive P-score > 6, Thl 
P-score < 2, and 3T3 L-l P-score < 2; 230 probes total. 

• Lost contacts in Th2: naive P-score -Th2 P-score > 6, | Thl 
P-score - naive P-score | < 3; 882 probes total. 

The genes in contact loci (Fig. 4) were obtained by retrieving 
those whose transcriptional unit (5' start to 3' end) overlaps with 
or lies within 10 kb of a 4C probe. 

To compare raw 4C profiles from different samples (Supple- 
mental Fig. SI), we obtained a fixed resolution profile of each 
data by computing the average 4C log ratio over a window of size 
100 kb, sliding along the genome at 50-kb increments. 



Affymetrix expression microarray analysis 

Total cellular RNA from naive, Thl, and Th2 cells was extracted 
with mirVana kit (Applied Biosystems/Ambion) in accordance 
with the manufacturer's instructions. Approximately 10 u,g of RNA 
was labeled and hybridized to GeneChip Mouse Genome 430 2.0 
arrays (Affymetrix) in accordance with the manufacturer's pro- 
tocols. Raw data in CEL files were preprocessed using Affymetrix 
Expression Console software with "RMA" and its quantile nor- 
malization method. We profiled expression of naive, Thl, and 
Th2 cells without TCR stimulation, and incorporated previous 
data for TCR-stimulated cells (Wei et al. 2010) by quantile- 
normalizing the combined set. In Figure 4 heat maps, we show 
TCR-unstimulated expression values of all the Affymetrix probe 
sets that correspond to the relevant genes, and therefore, the 
number of rows is greater than the number of nonredundant 
genes, since some genes are represented by multiple probe sets on 
the microarray. 

Integration of ChlP-seq data 

For the STAT4 binding profile in Thl cells, STAT4 ChlP-seq raw 
data were used from a previous study (Wei et al. 2010). Unmerged 
FDR 0% ChIP hotspots were computed from both WT and STAT4 
KO control ChlP-seq 25-mer read data (Baek et al. 2012). After re- 
moving regions overlapping the nonspecific STAT4 KO ChIP 
hotspots, WT STAT4 hotspots were further filtered by selecting 
those whose tag density is above the median value. This resulted 
in 8868 STAT4 hotspots that represent robust binding sites. For 
comparison, STAT6 binding data from the same study was treated 
analogously to produce 5803 STAT6 hotspots. Figure 5C was gen- 
erated by the following procedure: First, all STAT4 hotspots that 
overlap with or lie within 10 kb of positive 4C probes were re- 
trieved. The normalized number of STAT4 sites was calculated by 
dividing the total number of hotspots found in the previous step 
by the number of positive 4C probes (excluding the 10-Mb bait 
region). For Figure 5D, we performed the following steps. First, all 
the STAT4 binding sites that are covered by the NimbleGen probes 
(hotspots within 10 kb of any NimbleGen 4C probe) were included 
for analysis. We excluded sites on the 10-Mb-wide bait region 
surrounding the Ifng gene where the 4C signal is nearly maximal. 
Then STAT4 sites were further selected by requiring that the mean 
of the middle three bins (at the center of the STAT4 hotspot) increases 
from the naive to Thl profile. For each of the selected STAT4 site, 4C 
log ratios were computed with the resolution of 10-kb bins that span 
the 200-kb genomic region centered at the STAT4 hotspot. Shown 
in the plots are the average profiles over these STAT4 hotspots. The 
same procedure was applied to STAT6 binding sites as a control. 



Data access 

All the data have been submitted to the NCBI Gene Expression 
Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo/) under ac- 
cession no. GSE38717. 
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